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SEPARATION CUT-OFFS FOR BIRTH 
AND DEATH CHAINS 

By Persi Diaconis 1 and Laurent Saloff-Coste 2 
Stanford University and Cornell University 

This paper gives a necessary and sufficient condition for a se- 
quence of birth and death chains to converge abruptly to stationar- 
ity, that is, to present a cut-off. The condition involves the notions of 
spectral gap and mixing time. Y. Peres has observed that for many 
families of Markov chains, there is a cut-off if and only if the prod- 
uct of spectral gap and mixing time tends to infinity. We establish 
this for arbitrary birth and death chains in continuous time when the 
convergence is measured in separation and the chains all start at 0. 

1. Introduction. Some ergodic Markov chains show a sharp transition in 
convergence to stationarity. This was first observed for random transposi- 
tions on the symmetric group in [22]. The phenomenon was clearly identified 
in [2] where the term "cut-off phenomenon" was introduced (see [2], Figure 
2). Recently, Yuval Peres observed that for many examples, a cut-off occurs 
if and only if the product At tends to infinity where A is the spectral gap 
(i.e., 1 minus the second largest eigenvalue) and r is the mixing time (i.e., 
the first time the distance to stationarity is less than 1/4). 

Our main theorem proves a precise version of this statement for all fi- 
nite continuous-time birth and death chains started at when convergence 
is measured in separation distance. Namely, for each n, let 7* denote the 
distribution at time t > of a given ergodic birth and death chain on 
= {0, 1, . . . , n}, started at 0. Let v n be the corresponding stationary mea- 
sure. The separation between 7* and its target v n is 

Sep(7n> l/ n) = SU P I 1 - ln{ x )/ v n{x)}. 
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Let A n .i € [0,2], i = 0, ...,n, be the eigenvalues, in nondecreasing order, of 
minus the associated Q-matrix Q n (i.e., the matrix representing the infinites- 
imal generator of the associated Markov process). We always have A nj o = 0, 
and \ n = X Ut i is called the spectral gap. Set 

n 
1 

Theorem 1.1. Referring to the setting and notation introduced above, 
we have: 

1. // X n t n tends to infinity with n, then, for any c E (0, 1) , 

lim sep(7^ 1+c )*" , u n ) = 0, lim sep^ 1 "^' 1 , u n ) = 1. 

n — >oo n — >oc 

2. // X n t n does not tend to infinity with n, then, for any c £ (0, 1), 

limsupsep(7^ 1+c) * n ,^ n ) > 0, liminf sep(7^ 1_c)t ™ , v n ) < 1. 

n — >oo n 

See Theorem 5.1 below. A detailed analysis of the cut-off window is also 
obtained. The proof uses the duality theory of [16] to convert convergence 
rates into first hitting time estimates and Keilson's representation of first 
hitting times as sums of independent exponentials with parameters related 
to the spectrum of the chain. 

The paper is organized as follows. Section 2 discusses various distances 
and carefully defines the cut-off phenomenon. Section 3 gathers elementary 
remarks concerning the cut-off phenomenon in separation and total vari- 
ation distance. Birth and death chains are introduced in Section 4 which 
reviews duality and Keilson's spectral representation of hitting times. The 
main results — Theorems 5.1 and 5.2 — are stated and proved in Section 5. 
They provide a characterization of the cut-off phenomenon for continuous- 
time birth and death chains started at (in discrete time, the result is 
restricted to chains satisfying a certain monotonicity condition). Section 6 
gives, when it exists, a precise description of the shape of the separation 
cut-off. This shape may or may not be Gaussian. It is Gaussian if and only 
if the size of the window is of an order of magnitude strictly larger than the 
relaxation time 1/A (i.e., the inverse of the spectral gap). Section 7 gives 
detailed examples comparing cut-offs in separation, total variation and L 2 - 
distance. These examples includes simple random walks, Metropolis chains, 
the Bernoulli-Laplace and Erhrenfest chains and simple random walk on 
distance-transitive graphs. 
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2. Distances and cut-offs. Assume that to any finite set £1 and any pair 
of probability measures fx, v on f2 is associated a real number D{^,,v) such 
that D(fj,,v) £ [0,1], 

supsupD(/x, v) = 1 

and D(fi, v) = if and only if /x = v (in some cases, the supremum in Q in 
the equation displayed above might be necessary to attain the supremum of 
the possible values taken by D). Examples of interest are the total variation 
distance 

D(/j,, v) = - ^ ||tv = SU P - u(A) 

Acn 



and separation 



D([i, v) = sep(/i, u) = max 



Note that separation is not symmetric and is not a distance between prob- 
ability measures. Separation was introduced in [2, 3] in the context of the 
study of convergence of ergodic finite Markov chains. 

Consider a sequence of (finite) probability spaces (f2 n ,i> n ), n = 1,2,..., 
each equipped with a sequence of probability measures //^, k = 0,1,2, ... , 
such that 

lim D(ifi,v n ) = 0. 

K— >0O 

Definition 2.1. A family {Q, n ,v n , (fJ,n)k=o,i,...)n=i,2,... presents a cut-off 
(more precisely, a -D-cut-off ) if there exists a sequence (t n ) of positive reals 
such that for any e 6 (0, 1), 

(a) lirrin^oo D(^ n , f n ) = if /c n > (1 + e)t n for all sufficiently large n, 

(b) linin^oo D(n^ , v n ) = 1 if fc n < (1 — e)t n for all sufficiently large n. 

The next definition introduces the notion of window size for the cut-off 
phenomenon: 

Definition 2.2. Given sequences (t n ) and (b n ) of positive reals, we say 
that the family 



l,-/n=l,2,. 



presents a (t n , 6 n )-cut-off [more precisely, a (t n , b n )-D-cut-o&] if b n /t n tends 
to zero and 

(a) /+(c) = limsup^^ D(^n n+ , i/ n ) satisfies lim^oo / + (c) = 0, 
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(b) /_ (c) = lim infc^oo D(fj^ n cbn ^ , u n ) satisfies Hindoo /_ (c) = 1 . 

Both definitions can be interpreted in an obvious way when the discrete 
family of measure k = 0, 1, . . . , is replaced by a continuous family i/ n , 
t>0 (in this case, /+ and /_ are defined using t n + cb n without rounding to 
the next or previous integer). Versions of these definitions were introduced 
in [3, 15] in the case where the measures /x^, k = 0, 1, . . . , are the marginals 
of a Markov chain on Q n with stationary probability u n . See also [4] and 
[34], Section 2.4. 

Remarks. 1. For simplicity, we have restricted attention to the case 
where the maximum of the "distance" D is 1. However, Definitions 2.1 and 
2.2 can easily be extended to the case where D is unbounded, for example, 
the L 2 distance 

\ 1/2 



(2.1) D(fi,u) 



v{uj) 



In this case, in part (b) of each of the two definitions above, simply replace 
the limit value 1 by oo (see, e.g., [34], Section 2.4). 

2. Note that the definitions above do not require that the sequence t n 
tends to oo (this condition is required in [15, 34], but probably for no good 
reason. Thanks to Jim Fill for pointing this out to us). For instance, let 
£l n = {1, . . . ,n}, K n (x,y) = l/(n — 1) for all y ^ x G Q n and u n = 1/n. This 
family presents a (l,e n ) total variation cut-off for any sequence e n tending 
to 0. Indeed, \\K% — ^ n ||TV = V( n — 1) ? whereas \\K® — f n ||TV = {n—l)/n. 

3. If a family (/x^) with continuous parameter t has both a (t n )-cut-off 
and an (s n )-cut-off, then s n ~ t n (i.e., linin^oo s n /t n = 1). This is also true 
for a discrete-time family if one of t n or s n tends to infinity. However, for 
a discrete-time family having both a (t n ) and an (s n )-cut-off, the best that 
can be said in general is that the limit points of the sequence \t n — s n \ all 
belong to the interval [0,1]. Because of this, cut-off sequences that do not 
tend to infinity have to be treated with some special care in discrete time. 

Examples of finite Markov chain cut-offs are discussed in [15] which poses 
the following questions. How widespread is the cut-off phenomenon for fam- 
ilies of finite ergodic Markov chains and how can one recognize it? 

It has been suggested by Yuval Peres that in some generality, these ques- 
tions could be answered simply in terms of two parameters, namely, the 
L>-mixing time 

(2.2) r n D = r n D (e) = inf{k : D(ji k n , u n ) < e} 

and an appropriately defined notion of spectral gap. Here e is a small fixed 
parameter (e.g., one often picks e = 1/4). In the special case when the chain 
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is reversible (hence diagonalizable with real eigenvalues in [—1,1]), set A n = 
1 — p n , where (3 n is the second largest eigenvalue. Peres' suggestion is that a 
.D-cut-off occurs (say, in continuous time) if and only if the quantity \ n T® 
tends to oo. 

It turns out that the condition X n r^ tends to oo is indeed a necessary and 
sufficient condition for a cut-off to occur when D is the L 2 distance defined 
in (2.1) (and also when L 2 is replaced by LP with 1 < p < oo). This is proved 
in [10, 11]. Earlier results in this direction are described in [34]. This means 
that when working with the L 2 distance (2.1), one can often assert that a 
cut-off exists without necessarily having to or being able to give a precise 
asymptotic of the cut-off time. 

Examples due to David Aldous show that if D is total variation, the con- 
dition \ n Tn — ► oo does not necessarily imply a cut-off (see, e.g., [10] where 
a version of Aldous' example is presented). Still, it is natural to conjecture 
that the condition that X n T^ tends to infinity is sufficient under additional 
assumptions, for example, for random walks on finite groups based on small 
generating sets or for birth and death chains. 

In this paper, we consider the case of continuous-time birth and death 
chains on {0, 1, . . . ,m n } started at 0. When D is separation, we show that 
there is a cut-off if and only if X n Tn tends to infinity. This will follow from 
previous work of Diaconis and Fill [16], who produced optimal strong sta- 
tionary times through the construction of strong stationary duals, and works 
of Keilson [29] and Brown and Shao [8] linking spectral data to first passage 
times. 

3. Remarks on total variation and separation cut-offs. For n = 1, 2, . . . , 

let Q n be a finite set equipped with a Markov kernel K n (x, y) with stationary 
probability v n . Fix a starting point x n G £l n and consider the sequence of 
probability measures /j, n , k = 0,1, . . . , where fi^ is the distribution of the 
associated Markov chain started at x n after k steps. 

Let D stand for either the total variation distance or separation. One 
thing these two notions have in common is that given the data above, there 
exists a sequence of real nonnegative random variables having a useful 
interpretation and such that 



When D is total variation, the T®'s are "optimal coupling times," whereas 
when D is separation the T^^s are "optimal strong stationary times" (see [3, 
30] and the references therein). Let t n and a\ be, respectively, the mean and 
variance of the random variable . By a well-known form of Chebyshev's 
inequality (e.g., [24], (7.5), page 152), for all a > 0, we have 



D(jj*,v n ) = P(T?>k). 



(3.1) P{T» > t n + aa n ) < 



1 



P(T» <t n - aa n ) < 



1 



l + a 



2 ' 



l + a 
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Prom these facts we can draw the following conclusions: 

(a) For any e 6 (0, 1), the mixing time T®{e) defined in (2.2) satisfies 

t n - (e- 1 - \y l l 2 o n < r°(e) <t n + (e- 1 - lfl 2 a n . 

(b) If there is a constant c > such that ct n > a n and there is cut-off 
at time s n with lim n ^oo s n = oo, then s n ~ t n . When working in continuous 
time, the conclusion s n ~ t n holds true without having to assume that s n 
tends to infinity. 

(c) If o"" 1 ^ — > oo, then there is a (i n , cr n )-L>-cut-off. 

The upper bound in part (a) follows from the first inequality in (3.1) by 
solving (1 + a 2 ) -1 = e. The lower bound follows from the second inequality 
in (3.1) by solving (1 + a 2 ) -1 = 1 — e. 

Part (b) requires a little work and we treat only the continuous-time 
case. Assume there is a cut-off at time s n and fix rj,e 6 (0, 1). Then, for n 
sufficiently large, we must have 

(l-^n<T n D (e)<(l + 7 ? ) Sn . 

Setting e = (1 +rj 1 )~ 1 and using the first bound in (a), we obtain 

(1 - rj)s n <t n + i]a n < (1 + cq)t n . 
Using e = (1 + rj~ 2 )~ l and the second bound in (a) gives 

(1 - crf)t n <t n - r]a n < (1 + r])s n . 

This shows that s n ~ t n , as desired. 

In general, little is known about the times T^, so these remarks have 
only theoretical value. In particular, we know of no nontrivial cases where 
an optimal coupling time has been constructed in a useful way. In contrast, 
there are several known examples of optimal strong stationary times to which 
the remarks above apply (e.g., the top to random and riffle shuffles discussed 
in [14] and the geometric walks in [31]). 

In this context, the challenge posed by Peres' question is to relate the 
condition cr~ 1 t n — > oo to spectral information. When that can be done, the 
remarks above may yield useful results. This will be illustrated below. 

4. Separation for birth and death chains. Let Q = {0, .. . ,m}. A birth 
and death chain is a Markov chain K on VL such that K(x,y) = unless 
\x — y\ < 1. Write 

q x = K(x,x - 1), x = l,...,m, 
r x =K(x,x), x = 0,...,m, 

p x = K(x,x + l), x = 0, ...,m — 1 
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and, by convention, qo = p m = 0. We will assume throughout that the chain 
is irreducible, that is, that q x > for < x < m and p x > for < x < in. 
Such chains have stationary probability 

u(x)=cf[&=± t 

y=l % 

where c = z/(0) is a normalizing constant. Birth and death chains are in fact 
reversible, that is, they satisfy 

v{x)K(x,y) = u(y)K(y,x). 

It follows that the operator K : L 2 (Q,u) — > L 2 (J7,z/) defined by / i— ► Kf = 
^2 y K (• , y) f \y) is self-adjoint and thus diagonalizable with real eigenvalues 
in [—1,1]. Let Aj, i = 0, . . . ,m, be the eigenvalues of / — K in nondecreasing 
order (/ denotes the identity operator). Thus, Ao = < Ai < A2 < • • • < A m < 
2. The irreducibility of the chain is reflected in the fact that Ai > 0. It is 
also well known that A m = 2 if and only if the chain is periodic (of period 
2) which is the case if and only if r x = for all x. In fact, because we are 
dealing here with irreducible birth and death chains, it is known that the 
Aj's are all distinct (e.g., [8, 29]). Karlin and McGregor [27, 28] observed 
that the spectral analysis of any given birth and death chain can be treated 
as an orthogonal polynomial problem. This sometimes leads to the exact 
computation of the spectrum (see, e.g., [26, 27, 28, 36] and also [32] for a 
somewhat different approach based on continued fractions). 

Given a birth and death chain as above, let /x fc be its distribution after k 
steps starting at 0. Let 7* be the distribution at time t > of the associated 
continuous-time process started at 0, that is, 



In [16], Diaconis and Fill construct what they call a strong stationary 
dual for any discrete-time birth and death chain satisfying the condition 
Px + Qx+i < 1, < x < m, (such chains are called monotone chains). The 
dual chain is a birth and death chain with the same eigenvalues as the 
original chain. The first passage time at the extremity m for that dual chain 
is a strong stationary time for the original chain. The first passage time 
distribution is explicitly computed by Keilson and by Brown and Shao [8, 29] 
in terms of the spectral data. Fill [25] treats continuous-time chains (the 
condition p x + q x +\ < 1, < x < m, is not needed in that case). These works 
give the following result: 

Theorem 4.1 ([16, 25]). Let K be an irreducible birth and death chain 
as above. 
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(a) For the associated continuous-time process started at 0, we have 

(b) For the discrete-time chain, assuming that p x + q x +i < 1 3 < x < m, 
we have 

sep(„V) = ^{l - <£|} = £ n 3^(1 - A,)'- 

— — ^ V I ' i=l j^i 1 3 

Although these are beautiful formulas, it is not so obvious how to use them 
to derive explicit bounds. However, (a) has a very clear interpretation: it says 
that separation at time t is the tail of a sum of m independent exponential 
random variables with respective parameters Aj, 1 < i < m. Similarly, when 
all A, are in [0, 1] , (b) says that separation at time k is the tail of a sum 
of independent geometric random variables with respective parameters Xi, 
1 < i < m. In particular, we have the following obvious corollary: 

Corollary 4.2. Let K be an irreducible birth and death chain as above. 



(a) For the associated continuous-time process started at 0, we have 

sep( 7 V)= max (l - 2^2} = P(T > t) 

0<x<m { v(x) J 

where T = Y^i Si, each Si being an exponential random variable with pa- 
rameter Xi and the S^s being independent. In particular, 

m m 

E(T)=J2K\ Var(T) = £Ar 2 . 
l l 

(b) For the discrete-time chain, assuming that p x + q x +i < 1, < x < m, 
we have 

sepGu fc ,z/)= max (l-^$-\ = P(T>k), 

0<x<m{ v(x) J 

where T is a random variable with 

m m 

E(T) = £ A" 1 , Var(T) = £(1 - A,)A" 2 . 
l l 

The random variable T can be written as a sum T = J2T Si where the random 
variables Si, 1 < i < m, are independent and Si is geometric with probability 
of success Xi if Xi £ (0, 1], whereas Si is a Bernoulli variable with parameter 
A," 1 ifXi>l. 



CUT-OFFS FOR BIRTH AND DEATH CHAINS 



9 



Remarks. 1. The times Si have no known interpretations in terms of 
the underlying birth and death chain. 

2. Of course, the same results apply if the birth and death chain starts at 
the other extremity m. As the spectral data does not change, it follows that 
sep(7*(0, •), u), that is, the separation starting from 0, and sep(7 (m, •), u), 
that is, the separation starting from m, are equal at all times! This is in sharp 
contrast with what happens in total variation distance, for which starting 
at one or the other extremity can lead to very different behaviors. 

3. In view of the above results, and from the viewpoint developed in the 
next few sections, it is interesting to note that for any set of m distinct 
positive numbers < Ai < ■ • ■ < A m < 1, there is a birth and death chain as 
above with eigenvalues (0, Ai, . . . , A m ); see [33] and the references therein. 

4. Recall that, given an arbitrary positive distribution u on {0, 1, . . . , to}, 
the Metropolis algorithm based on symmetric random walk on the path 
{0,...,m} produces an irreducible birth and death chain having v as its 
stationary distribution. Thus, any positive distribution can occur as the 
stationary distribution of an irreducible birth and death chain. 

5. Continuous-time Markov chains offer the freedom to choose a time 
scale. Starting with a Markov kernel K(x,y) on a countable space, we can 
consider the continuous-time Markov chain generated by the K — I, where 
I denotes the identity matrix. Starting at x, this continuous-time chain has 
probability distribution at time t given by 

t n 

However, the most general and natural definition of a countable continuous- 
time Markov chain involves a matrix Q(x,y) (representing the generator) 
satisfying J2 y Q( x ^y) = an d Q(x,y) > if x 7^ y. In that generality, the 
quantity ^2 y ^ x Q{x,y) does not have to be uniformly bounded (and ex- 
plosion in finite time is possible). On a finite state space, we can always 
set q = max x {— Q(x, x)} and consider the (discrete-time) chain with ker- 
nel K(x,y) = I(x,y) + q~ 1 Q(x,y) where / is the identity matrix. Letting 
7g(x, •) be the probability distribution of the continuous-time Markov chain 
with generator Q started at x, we have 

1 t Q (x,.)=^ t (x,-). 

Let us now consider a family of continuous-time (finite state space) ergodic 
Markov chains (f2 n , lQ n ( x n,-),^n) and consider whether or not this family 
presents a D-cut-off. The answer to this question is independent of the 
chosen time scale. Indeed, using the notation introduced above, 



lQ n { X m-)=i q n\xn,-)- 
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It follows that 7^(x n , •), v n ) presents a Z)-cut-off at time t n [resp. a 
.D-cut-off of type (t n ,b n )] if and only if (Q n ,'jQ n (x n , -),v n ) presents a D- 
cut-off at time t n /q n [resp. a -D-cut-off of type (t n /q n ,b n /q n )]. This remark 
would not be valid if we had required that (t n ) must tend to infinity in the 
definition of a cut-off. 



5. Separation cut-off for birth and death chains. We now describe what 
the previous section entails concerning the cut-off phenomenon. For n = 
1,2, ... , let £l n = {0, 1, . . . , m n } be equipped with an irreducible birth and 
death chain K n having stationary measure v n . Let q n ,x, r n,x and p n ,x be the 
corresponding transition probabilities. 

Let //^ be the distribution of the associated chain at time k started at 0. 
Let 7^ be the distribution of the continuous-time process at time t started 
at 0. Let A nj j G [0, 2], < i < m n , be the corresponding eigenvalues. Set 

m„ 
l 

Finally, for any e E (0, 1), consider the separation mixing time 
T n (e) = M{t : sep(7* , v n ) < e}. 

Theorem 5.1. Referring to the setting and notation introduced above, 
the family 

(£l n ,V n , (7n)*>o) n =l,2,... 

has a separation cut-off if and only if N n = \ n t n tends to infinity. For any 
c > 0, the separation bounds 

(5.1) sep( 7 i 1+c) Sz/ ra )< 1 , se p( 7 (i-^ n ) > l 1 

1 + c z N n 1 + c A N n 

always hold and for any fixed e G (0, 1), i/ie condition \ n t n — > oo is equivalent 
to X n T n (e) -> oo. 

Proof. By Corollary 4.2, we have 

sep(/4,i/ n ) =P(T n >t), 
where T n has mean t n and variance Var(T^) — cr^ satisfying 

1 1 

^ \i 2 X] ^ n >* = 
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Here we have simply used the fact that A„/A n .j < 1 to obtain the middle 
inequality. Hence, we have 

(5.2) a n < t n and, better, a n < -/V~ 1//2 t n ,. 

The separation bounds (5.1) follow directly from (5.2) and the Chebyshev 
inequalities (3.1). 

Assume that N n = X n t n — > oo. By the second inequality in (5.2), it follows 
that t n /a n — > oo. By (c) of Section 3, there is a separation cut-off at time t n 
and, even better, a (t n , cr n )-cut-off. Conversely, if there is a cut-off at time 
s n , then by (5.2) and (b) of Section 3, we must have s n ~ t n and there must 
be a cut-off at time t n . By (5.1), this implies that N n tends to infinity. 

Now, fix e £ (0, 1). By the upper bound in (a) of Section 3 and the first 
inequality in (5.2), we have 

(5.3) r n {e) <t n + (a" 1 - l^Vn < (1 + (e" 1 - l)^ 2 )t n . 

Hence A n r n (e) — ► oo implies \ n t n — > oo. Conversely, if t n \ n — > oo, then there 
is a cut-off at time t n and by (5.2) and (a) of Section 3, t n ~ T n (e). It follows 
that A n r n (e) — > oo. This ends the proof of Theorem 5.1. □ 

Remarks. 1. Theorem 5.1 shows that for continuous-time birth and 
death chains started at 0, a separation cut-off can occur only if m n tends to 
infinity. 

2. For D and (O ra , u n , (^n)k=o,i,...) as in Definition 2.1, we say that there 
is a D-precut-off at time s n if there are constants 0<c<l<C<oo such 
that 

0, if k n > Cs n , 



lim D(n* n ,v n ) 



1, if k n cs n . 



Obvious modifications apply in continuous time. Theorem 5.1 shows that 
there cannot be a separation precut-off if X n t n is bounded. Hence, for con- 
tinuous birth and death chains started at 0, the existence of a separation 
precut-off is equivalent to the existence of a separation cut-off. 

3. In (5.1), we can replace N n = \ n t n by N' n = A n r n (e) as follows. The 
bound (5.3) and the first inequality in (5.2) give N' n < (1 + (e~ l - l) 1/2 )N n . 
Hence, for c > and e E (0, 1), we have 

sep( 7 ( 1+c ) i ",^)< 



and 



l + c 2 (l + (e- 1 -l) 1 /2)-iiv^ 

sep( 7 ( 1 - c )*",i/ n )>l 



1 + C 2 (1 + (£-!- l) 1 ^)-!^/- 

The same remark applies to (6.3) below which improves upon (5.1). 
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The next result is the discrete-time version of Theorem 5.1. It requires 
the "monotonicity" assumption p x + q x +i < 1- The proof is similar to that 
of Theorem 4.2(b) and is therefore omitted. 

Theorem 5.2. Referring to the setting and notation introduced above, 
assume that for each n and each x S {0, . . . , m n — 1}, we have 



has a separation cut-off if and only if N n = \ n t n tends to infinity. 

The so-called monotonicity condition p n ^ x + q n , x +i < 1 easily implies that 
r n,o > 0, thus ensuring aperiodicity. It is, however, a little surprising that 
negative eigenvalues of K (i.e., 1 — Aj with A, > 1) play no role whatsoever in 
Theorem 5.2. As in the continuous case, for any fixed e G (0, 1), the theorem 
above can be stated using r n (e) = infj/c : sep(/U^, v n ) < e} instead of t n . 

6. The shape of the cut-off. When a cut-off is determined, say at time 
s n , the next task is to look at the window size. If one is able to establish an 
(s n , b n ) -cut-off [possibly adjusting the sequence (s n )], then the question of 
the optimality of the window size b n is posed. One way to answer this ques- 
tion is to obtain the shape of the cut-off, that is, to determine the functions 
f± of Definition 2.2. If / + and 1 — /_ are nonzero in a neighborhood of oo, 
then the sequence (b n ) is optimal and the functions f± describe the shape of 
the cut-off. Only a small number of such results have been established (see, 
e.g., [15]). In the cases of interest to us in this paper, Corollary 4.2 easily 
allows us to obtain the shape of the cut-off. 

Theorem 6.1. Referring to a family of birth and death chains as in 
Section 5 and using the notation introduced there, assume that N n = \ n t n — ► 
oo and set 



Pn,x + Qn,x+1 — 1- 



Then the family 



n)k= 0,1,... ) 



n=l,2,... 



^2 Kl- 



in 




1 



(a) Assume that \ n o~ n — > oo. Then for any real c, 



n 



lim sep(7, 



n 



71 





In particular, there is a (t n ,o~ n )-cut-off, but no (t n ,X, 



)- cut- off. 
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(b) Assume that X n a n is bounded. Then there is a (t n ,o~ n ) -cut-off [equiv- 
alently a (t n , A" 1 ) -cut-off] and for any real c> 0, we have 

lim inf sep(7* n+CCT ™ , v n ) > 0, 

n — >oo 

whereas for any real c < 0, 

limsupsep(7*" +c<T ",i/„) < 1. 

n— >oo 

Proof. We have sep(7^,i/ n ) = P(T n > t). Consider the moment gener- 
ating function 

M n (t) = E(e t{T "- tn)/cTn ). 

As T n is a sum of m n independent exponential random variables with re- 
spective parameters A nj j, i = 1, . . . , m n , we have 



M n (t) = e- tt "/^ n / = eF " (t) > 



where 

rn~ on , , 

Jk{n) ^ 
bft 

i=l k=2 ' 

with 



F n (t) = -t n a~H - £log(l - X-}a-H) = £ -^7^ 



9 k (n) = J2(K,i/Xn,i) k - 

i=l 

As A n> i/A ni j < 1, we have 1 < 9k(n) < 62(n),k > 2. Hence, the series above 
converges, at least for t S (—1,1) and 

(6.1) 0<F„( t )- t V2<E M2W(t _ 2)/2 . 

If X n a n = 02{n) 1 / 2 — > oo, then M n (t) tends to e* 2 / 2 for any real t and it 
follows that a~ l {T n — t n ) is asymptotically distributed as a standard normal 
random variable; see [12]. This proves part (a) of Theorem 6.1. 
Assume now that X n a n is bounded, say X n a n < A. Hence, 

A" 1 < o- n < AX" 1 

and, for any k = 2,3, ... , 

1 < k (n) < A. 

Obviously (e.g., by Chebyshev's inequality), the distributions of a~ l {T n — 
t n ), n = 1,2,..., form a tight family. Given any subsequence (nj), we can ex- 
tract from it a subsequence (rij e ) such that along that subsequence, P(T n > 
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t n + ca n ) converges to P{T > c), c£l, for some random variable T. Now, 
from the previous moment generating function computation, it follows that 
along (rij e ), the limit 



lim 9 k (n je )=9 k e [1,A] 

l-*oo 

exists for each k > 2 and T has moment generating function 
(6-2) ex P(? + E^T72) 

V 2 tusker) 

for all t G (— O^ 2 , 0^ 2 ). As the variables a~ l {T n — t n ) are infinitely divisible, 
T is also infinitely divisible (see, e.g., [35] for properties of infinitely divisible 
distributions). By (6.2), the normal component of T is nontrivial and it 
follows that T has a smooth positive density. Obviously, this implies part 
(b) of Theorem 6.1. Note that no limit points of the sequence (T n — t n )/a n 
can be normal. This ends the proof of Theorem 6.1. □ 

Let us observe that the first part of Theorem 6.1(b), that is, the fact that 
for any real c> 0, 

lim inf sep(7* n+c<Tn , v n ) > 0, 

n— >oo 

can be proved by a very elementary argument. To bound P(T n > t n + ca n ) 
from below, write 

P(T n >t n + ca n ) 

(m n mn \ 

Sn,l > A" 1 + (C + lK;^ S n ,i > E K} - °n 
i=2 i=2 J 

(m n m„ \ 

E 5 n,i>E A nj-^ • 
i=2 i=2 ) 

By the second inequality in (3.1), and the fact that > Var(2^2 Sn,i)> we 
have 

(m n m„ \ 

E^>E A nj-^n >i 
i=2 i=2 / 

It follows that 

p( Tn > tn + C(Jn ) > le-^^fc+ljTnjA,, > i e -(i+( c+ i)A)_ 

In contrast, the inequality concerning the lower tail seems harder to prove 
without the sophisticated tools of infinitely divisible distributions. 
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Remarks. 1. In part (b) of Theorem 6.1, assume further that for each 

k, 

/A \ k 
9 k = lim — ^ < oo 

exists. Then for any c£K, we have 

lim sep(7*" +CCT ", v n ) = 1 - F(c), 

n — >oo 

where F(t) is the distribution function of an infinitely divisible law whose 
moment generating function is given in (— 6 2 ,9 2 ) by (6.2). In particu- 
lar, < F(c) < 1 for all cGR. For instance, the Bernoulli-Laplace example 
in Section 7 has a nonnormal cut-off shape in separation. 

2. It follows from (6.1) that J E( e ± ( T "-'™)/( 2<T ")) < e 1 / 4 . Hence, for all c> 0, 

P(T n >t n + ca n ) < e"( c - 1 /2)/2 ) p(Tn >tn _ CGn) > 1 _ e -(c-i/2)/2 
and thus 

sep( 7 (1+c)i ' l ,^)<e-( c7V "- 1 / 2 )/ 2 , 

(6.3) 

sep ( 7 (l-c)t„ ; ^ ) > 1 _ e -(ciV n -l/2)/2 ) 

where N n = \ n t n tends to infinity with n. 

7. Examples. This section illustrates our results by looking at various 
explicit (and not so explicit) families of birth and death chains. 

Simple random walk. Consider the simple random walk on {0, . . . , n} 
with rQ = r n = p = pj = qj = q n = 1/2, j £ {1, ... ,n—l} . We refer the reader 
to [24], XVI. 3, for the spectral information used below. We have 

Anj = l-cos — — , j = 0,...,n. 
n+1 

As X n j > (j/(n + l)) 2 (and this is optimal, up to a multiplicative constant), 
we see that A" 1 = A~ \, t n = J2 an d °~n = (S A~p^ 2 are all of order n 2 . 
By Theorems 5.1 and 5.2, there is no separation cut-off (either in continuous 
or discrete time). Of course, this is well known! 

Suppose, instead, that r n = pj = p, < j < n — 1 and qj = tq = q, 1 < j < 
n, p + q = 1, < q < p < 1. Then the eigenvalues are 

Anj = 1 — 2\/PQ cos — — , j = 0,...,n. 
n+1 

Hence, if p and q are fixed with < q < p < 1, we have X n = 1 — 2^/pq, 
whereas t n = J2i^n]j ^ s greater than n/2 and a n is greater than \Jnj2. 
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Theorems 5.1, 5.2 and 6.1 prove the existence of a (t n ,a n ) separation cut-off 
with a normal shape. Observe that 

f 1 -l 1 

t n = an + 0(l), a= (1 ~~ 2 A /pgcos(7rx)) dx = —== 

Jo v 1 - 



and 



a 2 n = bn + 0(1), b= (1 — 2 A /pgcos(7rx)) 2 c?x. 

JO 

Note that the window of the separation cut-off is not given by A^ 1 ~ 1 in 
this case. 

Using diagonalization, one finds that (starting at 0) this chain has a 
(2cn, 1)-L°° cut-off and a (era, 1)-L 2 cut-off with 

\og(p/q) 



2(1 -2y/pq)' 
It is a calculus exercise to check that c > a, that is, 

2(1 -2y/m) 



log(p/?) > 



Indeed, writing p = (1 + u)/2, q = (1 — u)/2, it G (0, 1), we get 1 — 4pg = u 2 
and the above inequality boils down to 



log(l + it) - log( l - u) 1 - VT 



2u 

which holds true for u € (0, 1) because the left-hand side is at least 1, whereas 
the right-hand side is at most 1. Thus, the L 2 -cut-off occurs later than the 
separation cut-off (this is not always true — there are many examples where 
the separation cut-off is twice the L 2 -cut-off ) . 

Diaconis and Fill [16], Example 4.46, and Belsley [6], Chapter V, study 
various versions of this chain in detail (in discrete time) and show that there 
is a cut-off in total variation at time an with an optimal window of size y/n. 
The fact that the total variation cut-off is the same as the separation cut-off 
can be explained as follows. If the chain starts from the top point ra, it is not 
hard to use the available spectral information to show that it converges in 
a constant number of steps (most of the mass is around ra). To understand 
total variation starting from the bottom point 0, it will thus be enough to 
analyze the first time one hits ra. By [8, 29], this first hitting time is equal 
in law to the optimal strong stationary time of Corollary 4.2(a). 

Note the very different window sizes, namely, of order 1 for the L 2 and 
L°° cut-offs and of order y/n for the separation and total variation cut-off. 
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Metropolis chains. Now consider an arbitrary probability distribution v 
on {0, ...,n} with v{j) > 0, j G {(),..., re}. Use the Metropolis algorithm 
with base chain the simple symmetric random walk above to obtain a birth 
and death chain with stationary measure v (see, e.g., [20]). By construction, 
this chain satisfies the monotonicity condition p x + q x +i < 1 and Theorems 
5.1, 5.2 and 6.1 apply. 

For instance, if = a(l + j) , then the results in [20] show that there 
is no cut-off (in total variation or separation) and that A n is of order n~ 2 . 
Hence, it follows from Theorem 5.1 that t n = Yyi Ki\ mus ^ be of order n 2 , 
that is, the eigenvalues Aj must grow rapidly enough from their minimum of 
order n~ 2 . We do not know if this can be easily checked by bounding higher 
eigenvalues. For instance, it does not follow from the (rather sophisticated) 
eigenvalue bound X n ^ > ci 2 / d n~ 2 given by [34], Theorem 3.4.4, and [20]. 

As a second example, take = 2~ n ("). It is proved in [21] that, for this 
example, A n is of order 1/n and it follows from the proof that in fact is 
of order i/n. Hence, t n is of order nlogn, \ n t n — > oo and X n a n is bounded. 
By Theorems 5.1 and 5.2, there is a (i^A" 1 ) cut-off in separation (starting 
from 0). The exact asymptotic behavior of the cut-off time t n is not known. 



Bernoulli-Laplace models. Consider two urns, the left containing r red 
balls and the right containing (n — r) black balls, with < 2r < n. At each 
step, a ball is picked uniformly at random in each urn and the two balls 
are switched. The process is completely determined by the number of red 
balls in the right urn and this is a birth and death chain on {0, . . . ,r}. The 
stationary distribution is 



V n> r{j) 



(PC:;) 



and for x G {0, ... , r}, the rates are given by 

(r — x)(n — r — x) x 2 

Px = t ^ j Qx = ~, r ) r x = 1 — Px ~~ Qx ■ 

r{n — r) r{n — r) 

The eigenvalues of this chain are well known (this goes back at least to [27]; 
see, e.g., [23]) and given by (with an obvious change in notation) 

_i(n — i + 
r[n — r) 



Hence, the smallest nonzero eigenvalue is 
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and we have 



- 1 

T\Tl — V) s 

£ 7 + 



n n(l — (i — l)/n) 

r(n 



n 



-(logr + 0(l)) 



and 



e 2{ n,r ) = ^y n , r = ±(-^—) 2 = {1) . 



i(n — i + 1) , 

In both cases, the 0(1) is uniform for all r < n/2 as r tends to infinity. Given 
this data, Theorem 5.1 shows that for any sequence (r^, n^) with — > oo and 
77 < n^/2, the associated continuous-time chain has a (s^, ^-separation cut- 
off with 

s e =(l- r £ /n e )r e logr e , & = (1 - r e /n e )r e . 
If 77/7^ tends to zero, then 

^,r f =^(logr f + 7 + 0(1)) and 9 2 (rii,re) = — + o(l), 

where 7 denotes the well-known 7 constant. In this slight variation 

on Theorem 6.1 shows that the limit shape for the (se, ^-separation cut-off 
is given by the Gumbel distribution [density exp(— (x + e~ x )) on R]. 

The results above should be compared with those of [23], Theorem 2, 
where an L 2 -cut-off of type ((i,£e) 1S proved with £g as above and 

Ct = ^(1 - ri/n e )r e logn e = ( ° gn£ \ e . 
2 V 2 log 17/ 

We now describe what happens in total variation for this family. This is 
briefly discussed in [7], Section 1.5. Recall that the total variation distance 
is easily bounded by both separation and I? (see, e.g., [3, 4]). Hence, if there 
is a total variation cut-off, it is bounded above by 

that is, the minimum of the separation cut-off time and the L 2 cut-off time 
Q. It turns out that this upper bound is sharp and that there is, in fact, a 
{.Pti ^£)-cut-off in total variation. We find this phenomenon quite interesting 
and surprising. For this natural family of chains, separation and L 2 cut-off 
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times cross each other [as functions of the parameters (n, r)] and the total 
variation cut-off time is given by the minimum. 

To prove this, we only need a lower bound on total variation matching the 
upper bound provided by the separation and L 2 results. This lower bound 
can be obtained by the method introduced in [23] and used there in the 
case r = n/2. Namely, to lower bound total variation between 7* r and its 
stationary measure f njr , use a set of the form A = {4>\ < a}, where (p\ is an 
eigenfunction associated with the lowest nonzero eigenvalue A n)J . = A n . rj i. A 
complete set of eigenfunctions {<pi : i = 0, . . . , r} ((pi = (j) n) r,i associated with 
A ra ,r,i) is described in [23]. In particular, we can take 

(f)o(x) = l, (f) l (x) = l Xn 

n(n — r) 

and 

2x(n-l) (n-l)(n-2)x(x-l) 

<p 2 {x) = 1 + — — — , 

n(n — r) r{n — r)[n — r — l)(r — lj 

where x € {0, . . . , r}. Given this data, one checks that 

2 1 n 2 — Ar(n — r) n 2 (n — r — l)(r — 1) 

01 = 700 + -7 w Tytfa + —, w TT7 7^92- 

n—1 r(n — r)(n — 2) r(n — r)(n — \)(n — 2) 

This formula allows us to compute the variance of cj)\ under 7* r [the variance 
of (f)\ under the stationary measure v n ^ r is (n — 1) ]. Namely, 



Var 7 * (<f >1 ) = — — + 



1 n 2 — Ar(n — r) 



n 



1 r(n — r)(n — 2) 



e 



-tx 



_|_ ^ (jl f f)(^* f) ^ — t\ n . r ,2 g — 2t\ n ,r,l 

r(n — r)(n — l)(n — 2) 



1 n 2 — Ar(n — r 



+ —, r; 6re 



n 



1 r(n — r)(n — 2) 



r,l 



+ ( n\n-r-l){r-l) _ A^x^ 

\r(n-r)(n-l)(n-2) J 
_|_ / j g— i(2A„, r ,i — A„, r ,2) ^g— tA„, r ,2 

= _JL_ + rx 2 -4r(ra-r) c _ Unrl 
n — 1 r(n — r)(n — 2) 

r(n - r)(2 - 3ra) + ra 2 (n - 1) c _ tXn r 2 
r(n — r)(n — l)(n — 2) 

-|- (1 g~*(2A n ,r,l — A n ,r,2) ^g~ t^Xn,r,2 

< C{— h - e ~ tXn ^ H e ~ tXn - r - 2 

\n — 1 r r(n — r) 
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For the last term, we have used 1 — e~ u <u, u>0, and 

2A -A 1 



r(n — r) 

That the difference between 2A rairj i and \ n ,r,2 is small is what makes this 
proof work. 

Now we consider the two cases r 2 <n and n < r 2 < n 2 /4. In the first case 
(r 2 < n), set 

t = (log r — c) , < c < log r, 

and consider A = {<pi < a} with 2a = e~ tXn - r - 1 . Chebyshev's inequality gives 

»n,M) ~ llM) > 1 - ^ - C'(^- + le c + ^e 2c )r 2 e" 2c 

n — 1 \n — 1 ' / 

logrT 



> 1 - C" e~ c + 



This, together with the earlier separation result, proves the existence of a 
(se,&) total variation cut-off when r 2 < n 2 and tends to infinity. 
In the second case (n < r 2 < n 2 /4), set 

t = — (log n—c), 0<c<log??,, 

and consider A = {0i < a} with, again, 2a = e~ tXn ' r ' 1 . Now, Chebyshev's 
inequality gives 

t fA\^i ne ~ 2C n>( 1 1 „c , lo g™2c\_-2e 



i as t / ,n ^ - ne 1 1 . logra 9 \ 

n—1 \n — 1 r\Jn n z / 

log n 



>1-C" e 



ne 



n 

This, together with the 1? result, proves the existence of a (Ce^e) total 
variation cut-off when ti£ < rf < n 2 /A and re tends to infinity. 

The hypercube and Hamming chains. Consider the set {0, ...,n— l} r 
and the Markov chain that picks one of the r coordinates uniformly at 
random and changes this coordinate to one of n — 1 other possible values 
picked uniformly at random (the chosen coordinate cannot stay the same). 
Starting from the vector, the number of nonzero coordinates evolves as a 
birth and death chain on {0, ... ,r}, with 

(r — x) x x(n — 2) 

Px — ) Qx — / 7T) fx 



r(n — 1) ' r(n — 1) 
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The stationary distribution is 



fn,rOc)= (Z) (n-lfn r . 



This chain has eigenvalues (see, e.g., [19], Section 5, for an elementary ar- 
gument) 

in 



Hence, 



r(n — 1) 



n 



r(n — 1) ' 



_ r(n- 1) ^ 1 

nrt * n 



n ~ % 



r 1 

i(n,r) = \- 2 r a 2 nr = ^2^ 1 . 

l 1 



Fix a sequence (ng,rg) with 77 — ► 00. Then by Theorem 5.1, the associated 
continuous-time chain has a (s^, ^-separation cut-off with 

s e = (l- 1/7^)77 log r^, & = (1 - l/ne)n. 

The shape is given by the Gumbel distribution. 

As a variation, consider the birth and death chain on {0, . . . , r} with 

(7.1) p* = — , & = -0, ^ = -(1-0), 0€(O,1). 

^1 1^* ^> 

This has stationary distribution 

This is the projection (under the natural action of the symmetric group S r ) 
of the probability measure 



(i + ey 



on the hypercube {0, l} r with x = (tCt)i5 x i G {0, 1} and |x| = X) The birth 
and death chain on {0, . . . ,r} with rates (7.1) is the projection of a chain K 
on the hypercube with K(x,x) = \ x\(l — 9)/r, 
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and K(x,y) = otherwise (see, e.g., [17], Section 3). The eigenvalues are 

Xe, r ,i = -(l + e). 

r 



Hence, 

% 1 + # . r .A 1 _ 2 2 v^l 

Consider a sequence (0£,rj>) with tending to infinity. By Theorem 5.1, 
the associated continuous-time chain has a separation cut-off of type 

{(l+9 e y 1 r £ logr e ,r e ) 

[Theorem 5.1 gives a window of size (1 + #^) -1 r^, but this is essentially 
equivalent since 0£ 6 (0,1)]. This chain is studied in [17] and [18], Section 5 
(as a certain Metropolis chain on the hypercube, in discrete time). There it 
is proved that the chain has an L 2 cut-off of type 



The reference [13] also proves that for a fixed 9, there is a 

((l+0) -1 rlog yr/#,r) 

total variation cut-off as r tends to infinity. Note, however, that this last 
result cannot hold true if 6 is allowed to vary and tend to 0. In general, for a 
sequence (0£,ri) with r£ tending to infinity, there is a total variation cut-off 
of type (pi,ri) with 



pi = {l + 9 e ) r e mm{logr e ,log ^r e /6t}. 

As for the Bernoulli-Laplace models, we only need to prove a total variation 
lower bound matching the upper bound given by the separation and L 2 
results. Such a total variation lower bound is easily derived using the data 
and method of [13], page 179. 

Distance regular graphs. A finite graph is distance-transitive if the au- 
tomorphism group of that graph acts transitively on the set of vertex pairs 
(x,y) with d(x,y) = k, for any k. Distance-regular graphs generalize this 
notion without requiring a group action (see, e.g., [5, 7, 9]). Let (V,E) be 
a connected graph with vertex set V and symmetric edge set E C V x V . 
Let p be the graph distance and m be the diameter of (V,E). A graph is 
regular if the number of vertices at distance 1 from x is independent of 
x £ V. A graph is distance-regular if for any i,j € {0, . . . ,m}, the number 
of vertices at distance i from x and j from y depends only on the distance 
p(x,y) between x and y, x,y £ V. 
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One well-known basic result is that simple random walk on a distance- 
regular graph can be studied by collapsing to a birth and death chain on 
{0, ...,m} started at 0; see [7], Section 3, where more general walks on 
distance-regular graphs that can be treated in a similar fashion are discussed. 
In this collapse, the set of all eigenvalues (without multiplicities) is conserved 
and separation, total variation or LP distance to the stationary measure for 
any fixed p are conserved. Thus Theorem 5.1 yields the following result: 

Theorem 7.1. Let Q n be a family of distance-regular graphs with diam- 
eter m n tending to infinity as n tends to infinity. On the vertex set V n of 
Q n , consider the continuous-time simple random walk and let 7* x be the 
law of that process started at a fixed arbitrary point x n G V n . Let v n be the 
uniform probability measure on (V n ). Let X n be the associated spectral gap 
and for any fixed e 6 (0, 1), set 

r n = inf{t > : sep(7* ia . n , u n ) < e}. 

Then the family (^,7* x , u n ) has a separation cut-off if and only if \ n T n — > 

CO. 

Remarks. 1. The separation sep(7^ x , v n ) does not depend on the 
starting point x n , so this result can be read as a max-separation result. 

2. Note that the family Q n does not need to be "natural" in any way. It 
can mix elements from the various natural families described below. 

Theorem 5.1 leads to the computation of the cut-off time, when it exists, 
in terms of the spectrum. It is conjectured by experts that distance-regular 
graphs have been classified. They include the following examples: 

(i) The finite circle Z/nZ with an edge from x to y if and only if \x — y\ = 
1. This family has no cut-off. 

(ii) Hamming distance graphs such as the hypercube. These have been 
discussed above. 

(iii) The natural graph on r-sets of an n-set with an edge from x to 
y if Pi y) = k — 1. This is equivalent to the Bernoulli-Laplace models 
discussed above. 

(iv) (/-Families: These are described in some detail in [7] with data that 
is useful for our purpose. These families are all related to certain types of 
vector-subspaces of a finite-dimensional vector space over a finite field ¥ q 
{q a prime power), hence the name. The simplest example is the set of all 
m-dimensional vector subspaces of an n-dimensional vector space discussed 
below. 

Regarding the known distance-regular graphs from the g-Families (g-DRG 
for short) listed in [7], we can state the following theorem: 
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Theorem 7.2. Referring to the setting of Theorem 7.1, assume that the 
family Q n is made of some of the known q-DRGs listed in [7]. Then there is 
a separation cut-off if and only if the diameter m n of Q n tends to infinity. 
Moreover, if m n tends to infinity, then there is a (m n , ^/m n ) separation 
cut-off with a nondegenerate normal shape. 

This immediately follows from the data reviewed in [7] and from Theorem 
5.1. Instead of going into the details, we will illustrate the result with the 
simplest case of a natural g-DRG family. 

Fixed- dimension subspaces of a q vector space. Let q be a prime power 
and F q be a finite field of order q. Let E n be an n-dimensional vector space 
over F q . For m < n/2, let V q , n ,m be the set of all m-dimensional vector 
subspaces of E n . The finite set V q , n ,m is equipped with the distance 

d{x, y) =m — dim(x H y) 

and the graph structure according to which (x,y) is an edge if and only 
if d(x,y) = 1. The induced graph distance is the distance d and this graph 
has diameter m < n/2. The action of GL n (F q ) on vector subspaces shows 
that this is a distance-transitive graph, hence a distance-regular graph. We 
consider the simple random walk on this graph (started at an arbitrary 
subspace) in continuous time. Let 7q irijm be its law at time t and v q n m 
its (uniform) stationary measure. Details (dealing with the discrete-time 
version) can be found in [7, 13]. As explained in [7, 13] and briefly above, this 
process can be studied through a birth and death chain on {0, . . . , m} which 
is simply the associated "distance process." Known computations involving 
relevant families of orthogonal polynomials give the eigenvalues \ q , n ,m,i of 
this chain as 

(l-g-*)(l-^-»-l) 
<\q,n,m,i — Q - g m " n )(l - q~ m ) ' % ^ 

Hence the smallest nonzero eigenvalue is 

(l_ g -l)(l_ g -n) 



(1 - q m - n )(l - q~ m ) 
and when m (hence also n) tends to infinity, we have 

m m 

E Kkm,i = ™ + O(l), E Ksi,m,i = m + 0(1)" 

i=l i=l 

If we now choose an arbitrary sequence (qi,ni,mi) and consider the family 
(n^jj,^) where 
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then Theorem 5.1 shows that this family has a separation cut-off if and only 
if mi tends to infinity. Assuming that mi tends to infinity, Theorem 6.1 
shows that there is an (mg, yjmg )-separation cut-off and that the window 
size y/mi is optimal. 

The references [7, 13], when translated into continuous time (in the present 
case, there are significant differences between discrete and continuous time), 
give a total variation cut-off of type (mg,y/mg) and an L 2 cut-off of type 
(sg, 1) with 

sg = \mg{nt-mg) log©. 
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